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<220> 

<221> DOMAIN 
<222> (1) . . (472) 

<223> kinesin like motor domain 
<220> 

<221> DOMAIN 

<222> (473) . . (2752) 

<223> rod domain 

<220> 

<221> DOMAIN 

<222> (2753) . . (2954) 

<223> tail domain 

<400> 1 

Met Ser Glu Gly Asp Ala Val Lys Val Cys Val Arg Val Arg Pro Leu 
1 5 10 15 

lie Gin Arg Glu Gin Gly Asp Gin Ala Asn Leu Gin Trp Lys Ala Gly 
20 25 30 

Asn Asn Thr He Ser Gin Val Asp Gly Thr Lys Ser Phe Asn Phe Asp 
35 40 45 



Arg Val Phe Asn Ser His Glu Ser Thr Ser Gin He Tyr Gin Glu He 
50 55. 60 



58 



ua Val Pre Ue lie «, Ser Ma Leu «. Gly Tyr Asn «» thr II. 

l M. Tyr «» th. Ser Ser Gly T* Tyr THr Met «et Gly 
8 5 

» riv lie He Pro Gin Ala lie Gin Glu Val Phe 
Thr Pro Asn Ser Leu Gly He He iiQ 



100 



Lys Tie He Gin 01" »• P ro «n «, Glu «- - - «« ^ 
115 

n T 1e Tvr Asn Glu Thr Val Lys Asp Leu Leu Cys Asp Asp 
Tyr Met Glu He Tyr asn uxu i4Q 

130 1J5 

^ w g. «• .i. «, - »p - - « « ?S 

Z V.X »U «P - ~ «» Ifo ^ !« V " 

„. Gin Trp U. Lys Gly OX. Lg - «, KU Tyr G!y Glu THr 

180 . ' ' • La3 

«n-'s*r Ser Arg Ser His Thr He Phe Arg Met lie 
Lys Met Asn Asp His Ser Ser arg 2Q5 

195 

» Ara Asn Asp "Pro Thr Asn Ser Glu Asn Cys. Asp 
Val Glu Ser Arg Asp Arg Asn Asp ^ 

210 



,r i «Pt Val Ser His Leu Asn Leu Val Asp Leu Ala Gly Ser 
Gly Ala Val Met Val ser nx ^ 

« rin Thr Gly Ala Glu Gly Val Arg Leu Lys Glu Gly 
Glu Arg Ala Ser Gin Thr Gly »x ^ 2 55 

245 

c ton Phe lie Leu Gly Gin Val He Lys Lys 
Cys Asn He Asn Arg Ser Leu Phe lie be 27Q 

y 260 ^ 

L eu Ser >sp Gly Gin Gly fy - »« « T - SH S " WS 
275 

r , n Asn S er Leu Gly Gly Asn Ala Lys Thr Val 
Leu Thr Arg He Leu Gin Asn Ser ^ 

290 295 

,r i c^y PhP Asd Glu Thr Leu Ser Tnr 
tie He Cys Thr He Thr Pro Val Ser Phe Asp 32Q 

£\«r Thr Ala Lys His Val Arg Asn Thr Pro His Val 
Leu Gin Phe Ala/Ser Thr Ala by ^ 335 

\ / 325 

^ * ri„ Ala Leu Leu Lys Arg Tyr Arg Lys Glu 

Asn Glu Val Leu Asp Asp Glu Ala Leu b 35Q 
340 

T „, Gln Leu Glu Asn Leu Glu Ser Ser Ser Glu 
He Leu Asp Leu Lys Lys Gin Leu t, 3g5 

355 

^.i n„ Thr Gin Leu Leu Ala 
Thr Lys Ala Gin Ala Met Ala Lys Glu Glu His Thr 

370 ■ 3/:> 



59 



Glu He Lys Gin Leu His Lys Glu Arg Glu Asp Arg He Trp His Leu 
385 390 395 400 

Thr Asn He. Val Val Ala Ser Ser Gin Glu Ser Gin Gin Asp Gin Arg 
405 410 415 

Val Lys Arg Lys Arg Arg Val Thr Trp Ala Pro Gly Lys He Gin Asn 
420 425 * ' 430 

Ser Leu His Ala Ser Gly Val Ser Asp Phe Asp Met Leu Ser Arg Leu 
435 440 445 

Pro Gly Asn Phe Ser Lys Lys Ala Lys Phe Ser Asp Met Pro Ser Phe 
450 455 460 

Pro Glu He Asp Asp Ser Val Cys Thr Glu Phe Ser Asp Phe Asp Asp 
465 470 475 480 

Ala Leu Ser Met Met Asp Ser Asn Gly lie Asp Ala Glu Trp Asn Leu 
485 490 495 

Ala Ser Lys Val Thr His Arg Glu Lys Thr Ser Leu His Gin Ser Met 

500 . " 505 510 

lie Asp Phe Gly Gin lie Ser Asp Ser Val Gin Phe His Asp Ser Ser 
515 520 525 

i 

Lys Glu Asn Gin Leu Gin Tyr Leu Pro Lys Asp Ser Gly Asp Met Ala 
530 535 540 

Glu Cys Arg Lys Ala Ser Phe Glu Lys Glu He Thr Ser Leu Gin Gin 
545 550 555 560 

Gin Leu Gin Ser Lys Glu Glu Glu Lys Lys Glu Leu Val Gin Ser Phe 
565 570 575 

Glu Leu Lys He Ala Glu Leu Glu Glu Gin Leu Ser Val Lys Ala Lys 
580 585 590 

Asn Leu Glu Met Val Thr Asn Ser Arg Glu His Ser He Asn Ala Glu 
595 600 605 

Val Gin Thr Asp Val Glu Lys Glu Val Val Arg Lys Glu Met Ser Val 
610 615 620 

Leu Gly Asp Ser Gly Tyr Asn Ala Ser Asn Ser Asp Leu Gin Asp Ser 
625 630 635 640 

Ser Val Asp Gly Lys Arg Leu Ser Ser Ser His Asp Glu Cys He Glu 
645 650 ' 655 

His Arg Lys Met Leu Glu Gin Lys He Val Asp Leu Glu Glu Phe He 
660 665 670 

Glu Asn Leu Asn Lys Lys Ser Glu Asn Asp Lys Gin Lys Ser Ser Glu 
675 680 685 



Gin Asp Phe Met Glu Ser He Gin Leu Cys Glu Ala He Met Ala Glu 
690 695 700 



i 
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Lys Ala Asn Ala Leu Glu Glu Leu Ala Leu Met Arg Asp Asn Phe Asp 
705 710 715 * 720 

Asn lie lie Leu Glu Asn Glu Thr Leu Lys Arg Glu lie Ala Asp Leu 
725 730 735 

Glu Arg Ser Leu Lys Glu Asn Gin Glu Thr Asn Glu Phe Glu lie Leu 
740 745 750 

Glu Lys Glu Thr Gin Lys Glu His Glu Ala Gin Leu He His Glu He 
755 *760 765 

Gly Ser Leu Lys Lys Leu Val Glu Asn Ala Glu Met Tyr Asn Gin Asn 
770 775 780 

Leu Glu Glu Asp Leu Glu Thr Lys Thr Lys Leu Leu Lys Glu Gin Glu 
785 790 795 800 

He Gin Leu Ala Glu Leu Arg Lys. Arg Ala Asp Asn Leu Gin Lys Lys 
805 810 • 815 

Val Arg Asn Phe Asp Leu Ser Val Ser Met Gly Asp Ser Glu Lys Leu 
820 — 825 830 

Cys Glu Glu He Phe Gin Leu Lys Gin Ser Leu Ser Asp Ala Glu Ala 
835 840 845 

Val Thr Arg Asp Ala Gin Lys Glu Cys Ser Phe Leu Arg Ser Glu Asn 
850 855 860 

Leu Glu Leu Lys Glu Lys Met Glu Asp Thr Ser Asn Trp Tyr Asn Gin 
865 870 875 880 

Lys Glu Lys Ala Ala Ser Leu Phe Glu Lys Gin Leu Glu Thr Glu Lys 
885 890 895 

Ser Asn Tyr Lys Lys Met Glu Ala Asp Leu Gin Lys Glu Leu Gin Ser 
900 905 910 

Ala Phe Asn Glu He Asn Tyr Leu Asn Gly Leu Leu Ala Gly Lys Val 
915 920 925 

Pro Arg Asp Leu Leu Ser Arg Val Glu Leu Glu Lys Lys Val Ser Glu 
930 935 940 

Phe Ser Lys Gin Leu Glu Lys Ala Leu Glu Glu Lys Asn Ala Leu Glu 
945 " 950 955 960 

Asn Glu Val Thr Cys Leu Ser Glu Tyr Lys Phe Leu Pro Asn Glu Val 
965 970 975 

Glu Cys Leu Lys Asn Gin He Ser Lys Ala Ser Glu Glu He Met Leu 
980 985 990 

Leu Lys Gin Glu Gly Glu His Ser Ala Ser He He Ser Lys Gin Glu 
995 1000 1005 



He He Met Gin Glu Gin Ser Glu Gin lie Leu Gin Leu Thr Asp Glu 
1010 1015 1020 
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Val Thr His Thr Ginger Lys Val Gin Gln^hr Glu Glu Gin Tyr^eu 

Glu Met Lys Lys Met His Asp Asp Leu Phe Glu Lys Tyr lie Arg Asn 
1045 10bU 

Lys ser Glu Ala Glu Asp Leu Leu Arg Glu Met Glu Asn Leu Lys Gly 
1060 1065 

Thr Met Glu Ser Val Glu Val Lys lie Ala Asp Thr Lys His Glu Leu 

1075 1080 
Glu Glu Thr lie Arg Asp^y. Glu Gin Leu Leu^is Glu Lys Lys Tyr 



1105 



1090 

Phe Phe Gin Ala Met Gin Thr He Phe Pro lie Thr Pro Leu Ser Asp 

1110 iiJ - D 



Ser Leu Pro Pro Ser Lys Leu Val Glu Gly Asn Ser Gin Asp Pro He 

1125 1130 
Glu lie Asn Asp Tyr His Asn Leu lie Ala Leu Ala Thr Glu Arg Asn 

H40 ll45 

Asn lie Met Val Cys Leu Glu Thr Glu Arg Asn Ser Leu Lys Glu Gin 

1155 1160 
Val He Asp Leu Asn Thr Gin Leu Wn Ser Leu Gin Ma Gin Ser He 

1170 1175 
Glu Lys Ser Asp Leu Gin Lys Pro Lys Gin Asp Leu Glu Glu Gly Glu 
1185 1190 1195 

Val Lys Leu Leu Leu Glu Met Glu Leu^eu Lys Gly His Leu^hr Asp 



1205 



ser Gin Leu Ser He Glu Lye L» «. Leu Glu Asn Leu^lu val Thr 



1220 



Glu Lys Leu Gin Thr Leu GlnGlu Glu Met Lys AsnHe Thr lie Glu 



1235 



Arg Asn Glu Leu Gin Thr Asn Phe Glu Asp Leu Lys Ala Glu His Asp 



1250 



1255 



Ser Leu Lys Gin Asp Leu Ser Glu Asn Ile^lu Gin Ser lie GluThr 



1265 



1270 



Gl„ Asp Glu Leu Ar, Ala Ala Gin Glu Glu Leu Ar g Glu Gin Lys Gin 
1285 lzyu 

L eu val Asp ser Phe Ar„ Gin Gin Leu Leu Asp Cys Ser Val Gly IXe 

1300 130:5 
ser ser Pro Asn His Asp Ala Val Ala Asn Gin GLu g. Val Ser Leu 

1315 13/0 
Gly Glu Val Asn Ser Leu Gin Ser Glu Met Leu Arg Gly Glu Arg Asp 
1330 1335 
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Glu Leu Gin Thr Ser Cys Lys Ala Leu Val Ser Giu Leu Glu Leu Leu 
1345 1350 1355 1360 

Arg Ala His Val Lys Ser Val Glu Gly Glu Asn Leu Glu lie Thr Lys 
1365 1370 1375 

Lys Leu Asn Gly Leu Glu Lys Glu lie Leu Gly Lys Ser Glu Glu Ser 
1380 1385 1390 

Glu Val Leu Lys Ser Met Leu Glu Asn Leu Lys Glu Asp Asn Asn Lys 
1395 1400 1405 

Leu Lys Glu Gin Ala Glu Glu Tyr Ser Ser Lys Glu Asn Gin Phe Ser 
1410 1415 ' 1420 

Leu Glu Glu Val Phe Ser Gly Ser Gin Lys Leu Val Asp Glu lie Glu 
1425 1430 1435 1440 

Val Leu Lys Ala Gin Leu Lys Ala Ala Glu Glu Arg Leu Glu lie Lys 
1445 : 1450 1455 

Asp Arg Asp Tyr Phe Glu Leu Val Gin Thr Ala Asn Thr Asn Leu Val 
1460 1465 1470 

Glu Gly Lys Leu Glu Thr Pro Leu Gin Ala Asp His Glu Glu Asp Ser 
1475 1480 1485 

He Asp Arg Arg Ser Glu Glu Met *Glu He Lys Val Leu Gly Glu Lys 
1490 " 1495 1500 

Leu Glu Arg Asn Gin Tyr Leu Leu Glu Arg Leu Gin Glu Glu Lys Leu 
1505 1510 1515 1520 

Glu Leu Ser Asn Lys Leu Glu lie Leu Gin Lys Glu Met Glu Thr Ser 
1525 1530 1535 

Val Leu Leu Lys Asp Asp Leu- Gin Gin Lys Leu Glu Ser Leu Leu Ser 
1540 1545 1550 

Glu Asn He He Leu Lys Glu Asn He Asp Thr Thr Leu Lys His His 
1555 1560 1565 

Ser Asp Thr Gin Ala Gin Leu Gin Lys Thr Gin Gin Glu Leu Gin Leu 
1570 1575 1580 

Ala Lys Asn Leu Ala He Ala Ala Ser Asp Asn Cys Pro He Thr Gin 
1585 1590 1595 1600 

Glu Lys Glu Thr Ser Ala Asp Cys Val His Pro Leu Glu Glu Lys He 
1605 1610 1615 

Leu Leu Leu Thr Glu Glu Leu His Gin Lys Thr Asn Glu Gin Glu Lys 
1620 1625 1630 

Leu Leu His Glu Lys Asn Glu Leu Glu Gin Ala Gin Val Glu Leu Lys 
1635 1640 1645 

Cys Glu Val Glu His Leu Met Lys Ser Met He Glu Ser Lys Ser Ser 
1650 1655 1660 
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. ~, T„e Hi«s Asd Thr Glu Gin Gin Leu Leu 
Leu Glu Ser Leu Gin His Glu Lys His A.p «£ ^ 

1665 1670 

t rin Gin Met Gin Val Val Thr Gin Glu Lys Lys Glu Leu 
Ala Leu Ly 3 . 01 "^ 1690 1695 

• uie Le u Thr Ala Glu Val Asp His Leu Lys Glu 

Gin Gin Thr His Glu His Leu TW fti 1?10 



1700 



Rs „ „.«. leu Gly leu »« £ «*■ - «• «» ^ 

Thr Cln leu £u *sn Glu *s„ «. - Glu «« 

1730 WJ3 



^ Leu Gla^s Glu X- »- - «" S " ^ 

Leu AsnGln Glu Met Glu Met V,, Met l.u Glu 

1780 . - .< 10 

= rm Ara Thr Val He Ala Glu Arg Asp Gin Leu Gin 
Leu Lys Asn Ser Gin Arg ™r 18Q5 



1795 

e*r Val Glu Met Ser lie Glu Thr Gin A'sp Asp 
Asp Asp Leu Arg Glu Ser Val Glu ^ 

1810 



le u «, .,3 M. Oin 01. U. Leu «!» Gln^ln Ly* »P Lys -1™ 
1825 1830 

G1 » tb. Se^Gln lie Ser val Leu^lu Glu Lye He s«Leu leu 
Clu « Ginjru Tyr »» Th. V,l Lye Glu «r L.u Ser 

t a «n Gin Ser Lys Gin His Leu Phe Ser Glu lie 
Glu Arg Asp Asp Leu Asn Gin Ser uy ^ 

1875 iHB 



te u set Leu S., Leu Lye Glu Lye Glu^e Me leu Glu =1. 
" 1890 1895 

R l a Glu Lye >ep Lye Ale «P * ~ °* ™ '"iSS 



1905 1910 
alu Lys He He G!u Clu Gln^eu Leu - - -~ 

Leu Lys «„ J~ «» set Leu Ile Gln "K Lys Glu 

Oin leu AleTu «. », ^ Hie leu clu - 

1955 . i3D 

, t rlv lvs Met Glu Gin Glu Arg Asp Glu Ala Ala Asn 
Asp Leu Ala Leu Gly Lys Met o ^ Q 

1970 
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Lys Val II. Ala Leo Thr Glu Lys Het Ser Jjr Leu Glu Glu Stall. 
1985 1990 

Asn Glu Asn vel Thr Thr Leu Lys Glu SI, Glu Gly Glu Lys^lu Thr 
2005 ZU1U 

m r rm Ara Pro Ser Lys Gin Gin Ser Ser Ser Gin Met Glu 
Phe Tyr Leu Gin Arg Pro ber 2030 

2020 202:> 



Glu Leu Arg Glu Ser Leu Lys Thr Lys Asp Leu Gin Leu Glu Glu Ala 

2035 2040 
Glu Lys Glu lie ser Glu Ala Thr Asn Glu He Lys Asn Leu Thr Ala 

2050 2055 
L ys lie ser ser Leu Glu Glu Glu lie Leu Gin As„ Ala Ser 1UU» 



2050 

c^r- t.«u Glu Glu Glu He Lex. 
2065 "70 

* s „ Glu Ala val Ser Glu Ar g Glu Asn Leu Ar, His Ser I*. Gin Gin 



2085 2090 



teu val ser Glu Leu Glu Gl„ Leu Ser Leu Thr Leu Lys Ser Ar g Asp 

2100 21Ub 
His Ala P he Ala Gin Ser Lys AT, Glu Lys Asp Glu Al, Val Asn Lys 

2115 ^ 1ZU 
He Ala ser Leu Ala Glu Glu He Vys lie Leu Thr Lys Glu Met Asp 
oi -in - 2135 



Gin Glu Gin Ser Ser His Leu 
2155 2160 

T«r Tvs Thr Glu Leu Gin Met Leu Lys Gin 
Ser Glu Glu Leu Cys Thr Tyr Lys Thr fcxu 



Glu Phe Arg Asp Ser Lys Glu Ser Leu Gin Gl^ ^ 
2145 2150 



2165 2170 



Gin Lys Glu Asp He Asn Asn Lys Leu Ala Glu Lys VI g. Glu Val 

2180 * L *° 
ASp Glu Leu Leu Gin His Leu Ser Ser Leu Lys Glu Gin Leu Asp Gin 



2195 2200 



He Gin Met Glu Leu Arg^Asn Glu Lys Leu Ar^Asn Tyr Glu Leu Cys 
2210 

Glu Lys Met Asp lie Met Glu Lys Glu lie ser val Leu Ar g Leu^.t 



2215 

Lys Met Asp ne n«u Glu ^ 

2230 

Gin Asn Glu Pro Gin Gin Glu Glu Asp Asp val Ala Glu «, Njt Asp 

2245 2250 
He Leu Glu Ser Ar g Asn Gin Glu lie Gin Glu Leu Met Glu Lys He 

2260 225b 
ser Ala Val Tyr S.r Glu Gin His Thr Leu Leu S.r Ser Leu Ser Ser 

2275 2280 
Glu Leu Gin Lys Glu Thr Glu Ala His Lys His Cys Met Leu Asn He 



2290 



2295 
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Lys Glu Ser Leu Ser Ser Thr Leu Ser Arg Ser Phe Gly Ser Leu Gin 
2305 2310 2315 2320 

Thr Glu His Val Lys Leu Asn Thr Gin Leu Gin Thr Leu Leu Asn Lys 
2325 2330 2335 

Phe Lys Val Val Tyr Arg Thr Ala Ala Val Lys Glu Asp His Ser Leu 
2340 2345 2350 

lie Lys Asp Tyr Glu Lys Asp Leu Ala Ala Glu Gin Lys Arg His Asp 
2355 2360 2365 

Glu Leu Arg Leu Gin Leu Gin Cys Leu Glu Gin His Gly Arg Lys Trp 
2370 2375 2380 

Ser Asp Ser Ala Ser Glu Glu Leu Lys Phe Cys Glu lie Glu Phe Leu 
2385 2390 2395 2400 

Asn Glu Leu Leu Phe Lys Lys Ala Asn lie lie Gin Ser Val Gin Asp 
2405 2410 2415 

Asp Phe Ser Glu Val Gin Val Phe Leu Asn Gin Val Gly Ser Thr Leu 
2420 . , - 2425 2430 

Gin Glu Glu Leu Glu His Lys Lys Gly Phe Met Gin Trp Leu Glu Glu 
2435 2440 2445 

Phe Gly Asp Leu His Val Asp Ala Lys Lys Leu Ser Glu Gly Met Gin 
2450 2455 2460 

Gin Glu Asn Arg Arg lie Ala Ser Thr lie Gin Leu Leu Thr Lys Arg 
2465 2470 2475 2480 

Leu Lys Ala Val Val Gin Ser Lys He Gin Arg Glu He Thr Val Tyr 
2485 2490 2495 

Leu Asn Gin Phe Glu Ala Lys Leu Gin Glu Lys Lys Glu Gin Asn Lys 
2500 ** 2505 2510 

Glu Leu Met Arg Arg Met Glu His His Gly Pro Ser Ala Ser Val Met 
2515 2520 2525 

Glu Glu Glu Asn Ala Arg Leu Leu Gly He Leu Lys Thr Val Gin Asp 
2530 2535 2540 

Glu Ser Lys Lys Leu Gin Ser Arg He Lys Met Leu Glu Asn Glu Leu 
2545 " * 2550 2555 2560 

Asn Leu Val Lys Asp Asp Ala Met His Lys Gly Glu Lys Val Ala lie 
2565 2570 2575 

Leu Gin Asp Lys Leu Leu Ser Arg Asn Ala Glu Ala Glu Leu Asn Ala 
2580 2585 2590 

Met Gin Val Lys Leu Thr Lys Lys Gin Asp Asn Leu Gin Ala Ala Met 
2595 " 2600 2605 

Lys Glu He Glu Asn Leu Gin Lys Met Val Ala Lys Gly Ala Val Pro 
2610 2615 2620 
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Tyr Lys Glu Glu lie Asp Asn Leu "Lys Thr Lys Val Val Lys lie Glu 
2625 2630 2635 2640 

Met Glu Lys lie Lys Tyr Ser Lys Ala Thr Asp Gin Glu lie Ala Tyr 
2645 2650 2655 

Leu Lys Ser Cys Leu Glu Asp Lys Glu Glu Gly Leu Arg Arg Leu Lys 
2660 2665 2670 

Glu Glu Leu Arg Arg Ala Gin Ala Asp Asn Asp Thr Thr Val Cys Val 
2675 " 2680 2685 

Pro Lys Asp Tyr Gin Lys Ala Ser Thr Phe Pro Val Thr Cys Gly Gly 
2690 ~ 2695 2700 

Gly Ser Gly lie Val Gin Ser Thr Ala Met Leu Val Leu Gin Ser Glu 
2705 2710 2715 2720 

Lys Ala Ala Leu Glu Arg Glu Leu Ser His Tyr Lys Lys Lys Tyr His 
2725 2730 2735 

His Leu Ser Arg Thr Met Ser Ser Ser Glu Asp Arg Lys Lys Thr Lys 
2740 2745 2750 

Ala Lys Ser Asp Ala His Ser Ser His Thr Gly Ser Ser His Arg Gly 
2755 * 2760 2765 

Ser Pro His Lys Thr Glu Thr Tyr *Arg His Gly Pro Val Thr Fro Glu 
2770 2775 2780 

Arg Ser Glu Met Pro Ser Leu His Leu Gly Ser Pro Lys Lys Ser Glu 
2785 2790 2795 2800 

Ser Ser Thr Lys Arg Val Val Ser Pro Asn Arg Ser Glu lie Tyr Ser 
2805 2810 2815 

Gin Leu Val Met Ser Pro Gly Lys Thr Gly Met His Lys His lie Leu 
2820 2825 2830 

Ser Pro Ser Lys Val Gly Leu His Lys Lys Arg Ala Leu Ser Pro Asn 
2835 2840 2845 

Arg Ser Glu Met Pro Thr Gin His Val He Ser Pro Gly Lys Thr Gly 
2850 2855 2860 

Leu His Lys Asn Leu Thr Glu Ser Thr Leu Phe Asp Asn Leu Ser Ser 
2865 2870 2875 2880 

Pro Cys Lys Gin Gin Lys Val Gin Glu Asn Leu Asn Ser Pro Lys Gly 
2885 2890 2895 

Lys Leu Phe Asp Val Lys Ser Lys Ser Met Pro Tyr Cys Pro Ser Gin 
2900 2905 2910 

Phe Phe Asp Asn Ser Lys Leu Gly Asp Phe Ser Glu Leu Asn Thr Ala 
2915 2920 2925 

Glu Ser Asn Asp Lys Ser Gin Ala Glu Asn Trp Trp Tyr Glu Ala Lys 
2930 2935 2940 
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Lys Glu Thr Ala Pro Glu Cys Lys Thr Ser 
2945 2950 



<210> 2 
<211> 9626 
<212> DNA 
<213> Xenopus sp. 

<220> 

<223> XCENP-E nucleotide sequence 

<220> 
<221> CDS 

<222> (143) , . (9007) 
<400> 2 

gaattccgga gtcggatagg ctagtcgggc gagggaaatt caaactggtt atagaagaaa 60 
cttgaaccgc cgccaaaagg gactaaagt.g acagagacag ggagcggtgt cggtaccgat 120 
ttctccacta atcggtctca aaatgtccga gggagatgca gttaaagtgt gtgtgagggt 180 
tcggccgctt atacagagag aacaagggga tcaagccaac ctgcaatgga aggctggaaa 240 
caacaccatt tcccaagttg atgggacaaa gtctttcaat ttcgatcgtg .tatttaattc 300 
tcacgaatca acaagtcaaa -tttaccaaga aatagcagta cctatcatac gatcagcttt 360 
gcagggatat aatggcacaa tatttgcata cggacagaca tcttcaggca agacgtacac 4 20 
aatgatggga acaccaaatt cattgggcat aataccccaa gccatacagg aagtttttaa 480 
aattattcag gagataccga acagagagtt tcttctaaga gtttcttata tggagattta 540 
caatgaaact gtgaaagacc tactgtgtg^ tgacagaaga aagaagccct tggaaattcg 600 
cgaggatttt aatagaaacg tgtatgttgc tgacctgact gaagaacttg ta'atggttcc 660 
tgaacatgta atacagtgga tcaaaaaggg tgaaaaaaac agacattatg gagagactaa 720 
aatgaatgat catagtagtc gttcacatac aatatttaga atgattgttg aaagccgaga 780 
cagaaatgat cccacaaatt cagagaactg tgatggagct gtcatggtat ctcacttgaa 840 
tttggtagat cttgctggca gtgaaagagc aagccaaact ggagctgaag gtgtgagact 900 
taaggaaggc tgcaacatca accgcagctt gtttatcctt ggacaggtta ttaagaagct 960 
tagcgacggc caggctggtg gatttataaa ctacagagac agcaaactca ccagaattct 1020. 
ccaaaattca ttgggaggaa atgctaaaac ggttataatt tgcacaatta cgccagtttc 1080 
ttttgatgag actctaagta cacttcagtt tgccagtact gccaaacatg tgagaaatac 1140 
tccccatgtt aatgaggtcc tggatgatga agcgttgcta aaaaggtaca gaaaggaaat 1200 
cttggattta aagaaacaat tagagaattt agagtcatcg tctgaaacaa aagctcaagc 1260 
aatggctaaa gaagagcata cacagttgct agctgaaatc aaacaactac acaaagagag 1320 
agaagataga atatggcact tgacaaatat tgttgttgct tcatcccaag aatctcaaca 1380 
ggaccaaagg gtcaaacgaa aacgaagagt tacgtgggcg ccaggaaaaa tccaaaatag 1440 
tttacatgct tctggtgttt ctgactttga tatgctatcc agattacctg gcaattttag 1500 
caagaaggca aagttctctg acatgccttc atttccagaa attgatgact ctgtttgtac 1560 
agagttttct gattttgatg acgccctctc catgatggac agcaatggaa tagatgcaga 1620 
atggaattta gccagtaaag taactcacag agaaaagaca tcacttcatc aatcaatgat 1680 
agactttgga cagatttctg acagtgttca gtttcatgat tcttctaagg aaaaccagct 1740 
acaatacctc cccaaagact ctggtgatat ggctgaatgc agaaaagctt cttttgaaaa 1800 
agagatcaca agcctccagc aacaactaca gtcaaaggag gaagaaaaaa aggaacttgt 1860 
acaaagcttc gagctcaaga tagcagaact ggaagagcag cttagtgtca aagctaaaaa 1920 
tctagagatg gttacaaact cgagagagca ttccattaat gctgaagtcc aaacagatgt 1980 
tgaaaaggaa gttgtgagaa aagaaatgtc agtccttgga gactctggtt acaatgcatc 2040 
aaacagtgac ctacaggata gttctgttga tggtaaacgt ctaagcagct cccatgatga 2100 
gtgtatagaa cacagaaaaa tgctggaaca aaagatcgtt gatttagaag agtttattga 2160 
aaaccttaac aagaaaagtg agaatgataa acaaaaatct tctgagcaag attttatgga 2220 
gagtattcag ctatgtgaag ctataatggc agaaaaggca aatgcactgg aggaactggc 2280 
acttatgcga gataattttg acaatattat tctagagaat gaaactctaa aaagggaaat 2340 
tgcagatctg gaacgttcac tcaaggagaa tcaagaaacc aatgagtttg aaattctgga 2400 
gaaggaaact caaaaagaac acgaggcaca actaatccat gagattggca gtttaaagaa 24 60 
attagttgaa aatgcagaga tgtacaatca aaatcttgag gaagatctag aaactaaaac 2520 
aaaacttctg aaagagcagg aaattcaact tgcagaatta aggaaacgcg cagataactt 2580 
gcagaaaaaa gtacgaaatt ttgatctctc ggtttccatg ggtgatagtg agaaactctg 2640 



68 



tgaagaaatc tttcaactga agcaatctct 
tcagaaggaa tgttctttcc tcagaagtga 
cacatcaaac tggtacaatc aaaaagaaaa 
aactgaaaaa tcaaactaca agaaaatgga 
ttttaatgag attaactact taaatggcct 
ttctcgtgtt gaattagaga aaaaggtttc 
ggaagaaaaa aatgccttgg agaatgaagt 
aaatgaagtt gaatgcttga aaaatcagat 
aaagcaagaa ggagaacatt ctgcatctat 
gcaatctgag cagattttac aactgactga 
gcagactgaa gagcaatact tggagatgaa 
tatcagaaac aaaagtgaag ctgaagacct 
tatggagtct gtggaagtaa agattgctga 
ggataaggag caactgcttc atgagaaaaa 
tccgattaca cctctttcag actcgcttcc 
agaccccata gaaatcaatg actaccacaa 
cattatggtg tgtctagaga ctgaaagaaa 
cactcaactt caaagtcttc aagcacaaag 
gcaagacttg gaagaaggag aggttaaact 
cctaactgac tcacagctgt ctatagagaa 
aaaactccaa acacttcaag aagagatgaa 
gaccaacttc gaagacctga aagcagagca 
cattgagcag tcaattgaaa^qacaagatga 
acagaagcaa ctggttgata gctttagaca 
atcaccaaac catgatgcag ttgctaacca 
gttacaaagt gaaatgctgc gtggtgaaag 
agtttcagaa ctggagctac ttcgtgctca 
aatcacaaaa aaactcaatg gccttgaaaa 
agtgttgaag tccatgttgg agaatctaaa 
agaggaatat tctagtaaag aaaatcaatt 
gaagctggta gatgagatag aggtcctgaa 
ggaaataaag gatagagatt attttgaact 
gggaaaattg gaaactccat tgcaagctga 
tgaagaaatg gagataaaag ttcttggaga 
aagattgcaa gaagaaaagc tggaactgtc 
ggagacgtcc gttctattaa aagatgacct 
aaacattatt ctaaaagaga atattgacac 
tcagctgcag aaaacacagc aagagctaca 
tgacaattgt ccaataactc aagaaaagga 
agaaaagata ttgttattaa ctgaagaatt 
actacatgaa aagaatgaac ttgaacaagc 
tctgatgaag agtatgatcg aatcgaagtc 
tgatactgaa caacaacttc ttgctcttaa 
gaaagagctg caacaaaccc atgaacactt 
tattgaattg ggtttgaatt ttaaaaatga 
tctgctaaat gagaataaag aacttgagca 
agagcttatg aaaagcttaa aggataaaga 
acaaaaagta attaatctga accaagaaat 
gaaaaatagc cagaggactg taattgctga 
gagtgttgaa atgtccattg aaactcaaga 
gcagcagaaa gataaagttc aggaactgac 
ctctcttctg gaaaatcaga tgctttataa 
aagagatgac ctgaaccagt ctaagcaaca 
gtctttaaaa gaaaaggaat ttgcattgga 
caggaaaaca atagatatca cagagaagat 
agccaccaat ttaaaggaaa ctctttatga 
actggctttg aacacagaac accttaggga 
taaaatggag caggagagag atgaagctgc 
gtcttctcta gaagaacaga tcaatgaaaa 
aaaagagacc ttctaccttc agagaccttc 
gctcagagag tctttaaaga ccaaagattt 



ttctgatgct gaagctgtga ctcgcgatgc 2700 
aaatctagag ctgaaggaga aaatggagga 2760 
ggctgcgtct ttgtttgaga agcagctgga 2820 
agctgatttg cagaaagagt tgcaaagtgc 2880 
tctggcagga aaggtcccca gagatttgct 2940 
tgagttctca aagcagcttg agaaagcatt 3000 
gacttgccta tcagaataca aatttttgcc 3060 
cagcaaggct tctgaagaga taatgttatt 3120 
tataagcaaa caagagatta tcatgcagga 3180 
cgaagtgaca cacacacagt caaaagtgca 3240. 
gaaaatgcat gatgatcttt ttgaaaagta 3300 
tttaagagaa atggagaacc ttaaaggcac 3360 
cacaaaacat gaacttgaag aaactataag 3420 
atactttttt caagcaatgc agactatatt 3480 
tccctcaaaa ttagttgaag ggaactctca 3540 
tttaatagcc cttgctacag aaaggaacaa 3600 
cagtctcaag gagcaagtta ttgatttgaa 3660 
tattgaaaag tctgatctcc agaagccaaa 3720 
gcttttggag atggaactac tgaagggaca 3780 
attgcagctg gagaatctgg aagttacaga 3840 
aaatattact atagaaagga atgagcttca 3900 
tgatagccta aaacaagacc ttagtgaaaa 3960 
attaagggct gcccaggaag agctaagaga 4020 
acagctttta gattgttctg taggaatttc 4080 
ggaaaaggtg tcattgggtg aagttaattc 4140 
agatgagctt caaacatctt gtaaggcatt 4200 
tgtaaaatct gtggagggag aaaaccttga 4260 
ggagatattg ggcaaatctg aagaaagcga 4320 
ggaagacaac aataagctca aagaacaagc 4 380 
cagtttagaa gaggtgttca gtggttcaca 44 40 
agcacagcta aaggcagcag aagaaaggct 4500 
tgtacaaact gcaaacacca atttagttga 4560 
ccatgaggaa gacagcattg atcggcgttc 4 620 
aaaacttgag cgaaatcagt atttactgga 4680. 
taacaaactt gaaatccttc agaaagagat 4740 
gcaacagaag ctagaaagct tgctgagtga 4800 
aaccctaaag catcattcag atactcaagc 4860 
gttggctaag aatcttgcaa tcgctgcttc 4 920 
aacctctgca gattgtgtgc atcctctgga 4980 
gcatcaaaaa actaatgaac aggaaaaatt 5040 
tcaggttgag ctaaagtgcg aggtggaaca 5100 
ctcacttgag tccttgcagc atgagaaaca 5160 
acagcagatg caagtagtta ctcaagaaaa 5220 
aacagctgag gtggaccatc taaaagagaa 5280 
agcgcagcaa aagaccacta aagagcaatg 5340 
aagccagcac agacttcaat gtgagataga 5400 
gtcagcgctg gaaactttaa aagaatctga 54 60 
ggaaatggtt atgctggaaa tggaggaatt 5520 
gagggaccag ctgcaagacg acctaaggga 5580 
tgatctaaga aaggctcaag aagcattgca 5640 
ctcccagatt tctgtcctgc aggaaaagat 5700 
tgttgcaact gtgaaagaaa ctctaagcga 57 60 
cctgttctca gaaattgaaa ctcttagtct 5820 
acaagcagag aaggacaaag ctgatgctgc 5880 
atcaaatata gaagaacagt tacttcaaca 5940 
aagagagagt cttatccagt gtaaggagca 6000 
aacattgaag agcaaagact tggcattggg 6060 
caataaagta atagctctta cagaaaagat 6120 
tgttactact ctaaaagaag gtgagggtga 6180 
aaagcagcag tcttcttccc agatggaaga 6240 
gcagttggaa gaggccgaga aggagataag 6300 
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tgaagctact aatgaaataa agaatctcac 
tcttcagaat gctagcattt tgaatgaagc 
gaagcagcaa ctggtttcag aattggagca 
tgcatttgcg caatctaaac gcgaaaagga 
tgaagaaata aagatcctga caaaagagat 
gcaagaacag tcttcccatc taagtgaaga 
gctcaagcaa cagaaagaag acatcaacaa 
tgagctattg caacacttat catctctaaa 
aaggaatgaa aagctcagaa actatgaact 
aatctcagtg ctgcgtttaa tgcagaacga 
acgtatggat atacttgaga gcagaaacca 
cgctgtgtat tcagagcaac acactttgct 
aactgaagca cacaaacatt gcatgttaaa 
cagatccttt ggcagcttgc aaactgagca 
tctgaacaaa tttaaggttg tataccgaac 
caaagattat gagaaggacc ttgctgctga 
actgcagtgt ttggagcagc acggcagaaa 
gttctgtgaa attgaattct tgaatgagtt 
tgtccaggat gacttttcag aggtgcaggt 
agaagagctt gagcacaaga aaggctttat 
cgtcgatgct aagaaactca gtgaaggcat 
catacagctc ttaacaaaaa ggctaaaggc 
aaccgtatat ctgaaccagt -ttgaagcgaa 
acttatgcgc agaatggagc accacggccc 
tagactttta ggcatactga aaactgttca 
caaaatgcta gaaaatgaac tgaacttggt 
agttgcaatt ttgcaagaca aactactaag 
gcaggtgaaa ctaactaaaa agcaagataa* 
cctacagaaa atggttgcca aaggtgcagt 
aactaaggtg gtaaagattg aaatggaaaa 
gattgcctac ttaaagtctt gtttggaaga 
ggaacttagg cgagcacagg cagacaacga 
gaaagcttca actttccctg tgacttgtgg 
aatgcttgtg ctgcagtcgg aaaaagccgc 
gaaatatcat catttatcac ggactatgtc 
aaaatctgat gctcattctt ctcatactgg 
tgaaacttac agacatggcc ctgttactcc 
aggatctcca aagaagtcgg agtccagcac 
aatctacagc caattagtaa tgtctccagg 
tccaagcaag gttggactgc acaaaaagcg 
cacccagcat gtcatatccc ctggcaagac 
gttattcgac aatttgtctt ctccatgcaa 
ccctaaaggc aaattatttg atgtgaagtc 
ttttgataat tctaagcttg gtgatttttc 
aagtcaggct gagaactggt ggtatgaagc 
atcctagatc cctgtacatc tgactctcct 
cttgtaggaa gaaacactag aaactgccat 
aagttgttct ttagtagtaa tcactggttg 
caatacgtct cacgtctatt ttatatgtta 
cttgacttca tatttggctc atctgtagtt 
aaaaatgaca tttgcagtgt attgttttac 
agcaccatta gtgtgccaat gtttttcact 
gaaaatacta caaaagatag ggcaaaagtt 
aacttaatta ttctcctaat aaaatattct 
ggttattctg cactcttgta aacctttttt 
tttaaattaa aaaaaaaacg gaattc 



tgctaagatc tcttctctag aggaggagat 6360 

tgtaagcgaa agggaaaacc ttcgccattc 6420 

gctatcactg acattaaaga gtagagacca 6480 

tgaagctgta aataaaatag ccagtctcgc 6540 

ggatgaattc agagattcaa aggaatcctt 6600 

gttatgtaca tataagactg aacttcaaat 6660 

caaacttgca gagaaagtta aggaagtgga 6720 

ggaacagctg gaccaaatac agatggagct 6780 

ctgcgaaaag atggatatca tggaaaaaga 6840 

gcctcagcag gaagaagatg atgttgcaga 6900 

agaaatacag gagctgatgg aaaaaatctc 6960 

cagcagtctc tctagtgagc ttcaaaagga 7020 

tataaaggaa tctctgtcat ccacgctctc 7080 

tgttaagcta aatactcaac tgcagaccct 7140 

tgctgcagtc aaagaagatc atagcttgat 7200 

gcaaaagagg catgatgagc tgcgactcca 7260 

atggtcggat tctgcatctg aggaactcaa 7320 

actttttaaa aaagcaaata taattcagag 7380 

attcctaaat caagtaggat caacactgca 7440 

•gcagtggttg gaggaatttg gagatctgca 7500 

gcaacaggaa aataggcgca ttgcttctac 7560 

agttgttcag tcgaaaatac aacgtgagat 7 620 

attgcaagag aagaaagaac aaaacaaaga 7680 

tagtgctagt gtaatggagg aagaaaatgc 7740 

agatgaatcc aagaaactcc aatcaaggat 7800 

caaagatgat gccatgcaca aaggtgaaaa 7860 

cagaaacgcc gaagctgagc taaacgcaat 7 920 

tcttcaggct gcaatgaaag aa-atagaaaa 7980 

accatataaa gaagaaattg acaaccttaa 8040 

aataaagtac tcaaaagcaa cagaccaaga .8100 

taaggaagaa ggcttgcgta ggttaaaaga 8160 

tacaacagtt tgtgttccaa aagattatca 8220 

tggtggaagc ggtatagtgc agagcacagc 8280 

cttggaaagg gagctgtcac attataagaa 8340 

aagttctgaa gatcgaaaga aaacaaaggc 8400 

atcatcacac agaggctcgc ctcacaaaac 84 60 

agaaaggtct gaaatgccaa gcctacacct 8520. 

taaacgtgtt gtgtcaccaa acaggtccga 8580 

caagaccggg atgcataaac atatactttc 8 640 

tgccctgtct ccaaacagat cggagatgcc 8700 

cggactgcat aaaaatctaa ctgaaagcac 8760 

acagcaaaaa gtacaggaaa atctaaattc 8820 

aaaatcgatg ccttactgtc catctcagtt 8880 

agagctcaac acagcagaga gcaatgacaa 8 940 

aaaaaaagaa acggcacctg aatgtaaaac 9000 

gtcctgcaaa gagacttgct actctgcctt 9060 

gtctgcataa aggagtctca ctggaagcaa 9120 

ggcgagtggt tacgtcttta aaataaagtg 9180 

tgtctgtgta tttgttacac ttttaagtcc 9240 

tctttgtgtt tgcgcacaca cacactggtg 9300 

tgactggtct ctctggggcc atccatgcaa 9360 

acttattatt atgtctgact ttgtgaaata 9420 

gttttctcac taggtaaacg atatgggttt 94 80 

atataatggc atctcattag gtgacctaat 9540 

aaatttcact cgtaataaag cagccctgat 9600 

9626 
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<210> 3 
<211> 9 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequenced amino acid 
insertion encoded by one cDNA isolated 

<400> 3 

Asn Ser Arg Glu His Ser lie Asn Ala 
1 5 



<210> 4 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : putative 
nuclear localization signal (NLS) 

<400> 4 

Arg Lys Lys Thr Lys 
1 5 



<210> 5 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence -.primer 
<400> 5 

gggctgccca ggaagag 



<210> 6 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 6 

gacagcattg atcggcg 



<210> 7 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
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<400> 7 

gagggttcgg ccgctta 



<210> 8 
<211> 17 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 8 

tctggggcca tccatgc 



<210> 9 
<211> 28 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 9 

catatgacca tggccgaggg agatgcag 

4 

<210> 10 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 
<400> 10 

gtcaggtcag caacatacac g 



<210> 11 
<211> 30 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : sequence linked 
at C-terminus of amino acids 1-473 of XCENP-E in 
plasmid pET23dXCEMycHis 

<220> 

<221> PEPTIDE 

<222> (12) . . (20) 

<223> c-myc epitope tag 

<400> 11 

Thr Val Ser lie Ser Leu Gly Asp Leu Thr Met Glu Gin Lys Leu lie 
15 10 15 



Ser Glu Glu Asp Leu Asn Phe Glu His His His His His His 
20 25 30 



